Mandarin Chinese tone nucleus detection with landmarks
نویسندگان
چکیده
This paper discusses a new approach to improve tone recognition by modeling the tone nucleus with vowel landmark detection. The tone nucleus region is identified based on vowel landmark frames derived by an automatic landmark recognition system. In the corresponding tone recognition experiments, the best results with landmark-based tone nucleus regions outperform the best baseline system results by more than 6%. Moreover, in an exploratory experiment, the tone recognition accuracy using tone nucleus regions based only on vowel landmark evidence shows less than 2% degradation relative to the accuracy obtained using both landmark frames and force-aligned vowel boundary information. These findings further demonstrate the potential to perform tone recognition based on landmark detection alone, without full speech recognition or aligned transcriptions.
منابع مشابه
Tone perception of Mandarin-speaking postlingually deaf implantees using the Nucleus 22-Channel Cochlear Mini System.
Because Mandarin Chinese is a tonal language, testing the patient's ability to distinguish among four tones is of paramount importance. This paper evaluates the efficacy of the Nucleus 22-Channel Mini System for Mandarin Chinese by comparing the postoperative tone perception test results with the results of the closed-set monosyllable, trochee, and spondee (MTS) test, and the open-set phonetica...
متن کاملA Pitch Smoothing Method for Mandarin Tone Recognition
Mandarin Chinese is known as a tonal language with four lexical tones. Tone recognition plays an important role in automatic Chinese speech recognition in that the same syllable with different tones gives quite distinct meanings. The different tone can be characterized by its pitch contour, but the pitch contours are hardly ideal smooth curves. It is because the pitch points calculated by pitch...
متن کاملSubsyllabic Tone Units for Reducing Physiological Effects in Automatic Tone Recognition for Connected Mandarin
This paper presents our attempt to model physiological transition effect on syllable F0 contour in order to improve lexical tone recognition performance for Mandarin Chinese. We suggested that a syllable F0 contour consists of three segments: onset course, tone nucleus and offset course. Among the three segments, only tone nucleus contains key features for tone recognition, and the other two re...
متن کاملIncorporating Pitch Features for Tone Modeling in Automatic Recognition of Mandarin Chinese
Tone plays a fundamental role in Mandarin Chinese, as it plays a lexical role in determining the meanings of words in spoken Mandarin. For example, these two sentences R R (I like horses) and R M (I like to scold) differ only in the tone carried by the last syllable. Thus, the inclusion of tone-related information through analysis of pitch data should improve the performance of automatic speech...
متن کاملA Perception Study on the Third Tone in Mandarin Chinese
This experimental study examines the role of the shape of the pitch contour in the perception of the Mandarin Chinese tone 3. A set of stimuli was constructed by varying the pitch of tone 3 on two conditions: (1) varying the duration of the dip (or turning point) and (2) varying the timing of the turning point (duration of the slope). The manipulated pitch contours of tone 3 were presented to t...
متن کامل